Sherman
Abstract:Analog machine-learning hardware platforms promise greater speed and energy efficiency than their digital counterparts. Specifically, over-the-air analog computation allows offloading computation to the wireless propagation through carefully constructed transmitted signals. In addition, reconfigurable intelligent surface (RIS) is emerging as a promising solution for next-generation wireless networks, offering the ability to tailor the communication environment. Leveraging the advantages of RIS, we design and implement the ordinary differential equation (ODE) neural network using over-the-air computation (AirComp) and demonstrate its effectiveness for dual tasks. We engineer the ambient wireless propagation environment through distributed RISs to create an architecture termed the over-the-air ordinary differential equation (Air-ODE) network. Unlike the conventional digital ODE-inspired neural network, the Air-ODE block utilizes the physics of wave reflection and the reconfigurable phase shifts of RISs to implement an ODE block in the analog domain, enhancing spectrum efficiency. Moreover, the advantages of Air-ODE are demonstrated in a deep learning-based semantic communication (DeepSC) system by extracting effective semantic information to reduce the data transmission load, while achieving the dual functions of image reconstruction and semantic tagging simultaneously at the receiver. Simulation results show that the analog Air-ODE network can achieve similar performance to the digital ODE-inspired network. Specifically, for the image reconstruction and semantic tagging task, compared with the analog network without the Air-ODE block, the Air-ODE block can achieve around 2 times gain in both reconstruction quality and tagging accuracy.
Abstract:Electric Vertical Take-Off and Landing (eVTOL) aircraft, pivotal to Advanced Air Mobility (AAM), are emerging as a transformative transportation paradigm with the potential to redefine urban and regional mobility. While these systems offer unprecedented efficiency in transporting people and goods, they rely heavily on computation capability, safety-critical operations such as real-time navigation, environmental sensing, and trajectory tracking--necessitating robust offboard computational support. A widely adopted solution involves offloading these tasks to terrestrial base stations (BSs) along the flight path. However, air-to-ground connectivity is often constrained by spectrum conflicts with terrestrial users, which poses a significant challenge to maintaining reliable task execution. Cognitive radio (CR) techniques offer promising capabilities for dynamic spectrum access, making them a natural fit for addressing this issue. Existing studies often overlook the time-varying nature of BS resources, such as spectrum availability and CPU cycles, which leads to inaccurate trajectory planning, suboptimal offloading success rates, excessive energy consumption, and operational delays. To address these challenges, we propose a trajectory optimization framework for eVTOL swarms that maximizes task offloading success probability while minimizing both energy consumption and resource competition (e.g., spectrum and CPU cycles) with primary terrestrial users. The proposed algorithm integrates a Multi-Armed Bandit (MAB) model to dynamically estimate BS resource availability and a Monte Carlo Tree Search (MCTS) algorithm to determine optimal offloading decisions, selecting both the BSs and access time windows that align with energy and temporal constraints.
Abstract:The stacked intelligent metasurface (SIM), comprising multiple layers of reconfigurable transmissive metasurfaces, is becoming an increasingly viable solution for future wireless communication systems. In this paper, we explore the integration of SIM in a multi-antenna base station for application to downlink multi-user communications, and a realistic power consumption model for SIM-assisted systems is presented. Specifically, we focus on maximizing the energy efficiency (EE) for hybrid precoding design, i.e., the base station digital precoding and SIM wave-based beamforming. Due to the non-convexity and high complexity of the formulated problem, we employ the quadratic transformation method to reformulate the optimization problem and propose an alternating optimization (AO)-based joint precoding framework. Specifically, a successive convex approximation (SCA) algorithm is adopted for the base station precoding design. For the SIM wave-based beamforming, two algorithms are employed: the high-performance semidefinite programming (SDP) method and the low-complexity projected gradient ascent (PGA) algorithm. In particular, the results indicate that while the optimal number of SIM layers for maximizing the EE and spectral efficiency differs, a design of 2 to 5 layers can achieve satisfactory performance for both. Finally, numerical results are illustrated to evaluate the effectiveness of the proposed hybrid precoding framework and to showcase the performance enhancement achieved by the algorithm in comparison to benchmark schemes.
Abstract:The emergence of sixth-generation and beyond communication systems is expected to fundamentally transform digital experiences through introducing unparalleled levels of intelligence, efficiency, and connectivity. A promising technology poised to enable this revolutionary vision is the wireless large AI model (WLAM), characterized by its exceptional capabilities in data processing, inference, and decision-making. In light of these remarkable capabilities, this paper provides a comprehensive survey of WLAM, elucidating its fundamental principles, diverse applications, critical challenges, and future research opportunities. We begin by introducing the background of WLAM and analyzing the key synergies with wireless networks, emphasizing the mutual benefits. Subsequently, we explore the foundational characteristics of WLAM, delving into their unique relevance in wireless environments. Then, the role of WLAM in optimizing wireless communication systems across various use cases and the reciprocal benefits are systematically investigated. Furthermore, we discuss the integration of WLAM with emerging technologies, highlighting their potential to enable transformative capabilities and breakthroughs in wireless communication. Finally, we thoroughly examine the high-level challenges hindering the practical implementation of WLAM and discuss pivotal future research directions.
Abstract:Reconfigurable intelligent surface (RIS) is emerging as a promising technology for next-generation wireless communication networks, offering a variety of merits such as the ability to tailor the communication environment. Moreover, deploying multiple RISs helps mitigate severe signal blocking between the base station (BS) and users, providing a practical and efficient solution to enhance the service coverage. However, fully reaping the potential of a multi-RIS aided communication system requires solving a non-convex optimization problem. This challenge motivates the adoption of learning-based methods for determining the optimal policy. In this paper, we introduce a novel heterogeneous graph neural network (GNN) to effectively leverage the graph topology of a wireless communication environment. Specifically, we design an association scheme that selects a suitable RIS for each user. Then, we maximize the weighted sum rate (WSR) of all the users by iteratively optimizing the RIS association scheme, and beamforming designs until the considered heterogeneous GNN converges. Based on the proposed approach, each user is associated with the best RIS, which is shown to significantly improve the system capacity in multi-RIS multi-user millimeter wave (mmWave) communications. Specifically, simulation results demonstrate that the proposed heterogeneous GNN closely approaches the performance of the high-complexity alternating optimization (AO) algorithm in the considered multi-RIS aided communication system, and it outperforms other benchmark schemes. Moreover, the performance improvement achieved through the RIS association scheme is shown to be of the order of 30%.
Abstract:The recently proposed multi-chirp waveform, affine frequency division multiplexing (AFDM), is considered as a potential candidate for integrated sensing and communication (ISAC). However, acquiring accurate target sensing parameter information becomes challenging due to fractional delay and Doppler shift occurrence, as well as effects introduced by the coexistence of near-field (NF) and far-field (FF) targets associated with large-scale antenna systems. In this paper, we propose a novel angle-delay-Doppler estimation scheme for AFDM-ISAC system in mixed NF and FF scenarios. Specifically, we model the received ISAC signals as a third-order tensor that admits a low-rank CANDECOMP/PARAFAC (CP) format. By employing the Vandermonde nature of the factor matrix and the spatial smoothing technique, we develop a structured CP decomposition method that guarantees the condition for uniqueness. We further propose a low-complexity estimation scheme to acquire target sensing parameters with fractional values, including angle of arrival/departure (AoA/AoD), delay and Doppler shift accurately. We also derive the Cram\'er-Rao Lower Bound (CRLB) as a benchmark and analyze the complexity of our proposed scheme. Finally, simulation results are provided to demonstrate the effectiveness and superiority of our proposed scheme.
Abstract:Large language models (LLMs) face significant challenges in specialized domains like telecommunication (Telecom) due to technical complexity, specialized terminology, and rapidly evolving knowledge. Traditional methods, such as scaling model parameters or retraining on domain-specific corpora, are computationally expensive and yield diminishing returns, while existing approaches like retrieval-augmented generation, mixture of experts, and fine-tuning struggle with accuracy, efficiency, and coordination. To address this issue, we propose Telecom mixture of models (TeleMoM), a consensus-driven ensemble framework that integrates multiple LLMs for enhanced decision-making in Telecom. TeleMoM employs a two-stage process: proponent models generate justified responses, and an adjudicator finalizes decisions, supported by a quality-checking mechanism. This approach leverages strengths of diverse models to improve accuracy, reduce biases, and handle domain-specific complexities effectively. Evaluation results demonstrate that TeleMoM achieves a 9.7\% increase in answer accuracy, highlighting its effectiveness in Telecom applications.
Abstract:In this paper, we investigate reconfigurable pixel antenna (RPA)-based electronic movable antennas (REMAs) for multiuser communications. First, we model each REMA as an antenna characterized by a set of predefined and discrete selectable radiation positions within the radiating region. Considering the trade-off between performance and cost, we propose two types of REMA-based arrays: the partially-connected RPA-based electronic movable-antenna array (PC-REMAA) and fully-connected REMAA (FC-REMAA). Then, we formulate a multiuser sum-rate maximization problem subject to the power constraint and hardware constraints of the PC-REMAA or FC-REMAA. To solve this problem, we propose a two-step multiuser beamforming and antenna selection scheme. In the first step, we develop a two-loop joint beamforming and antenna selection (TL-JBAS) algorithm. In the second step, we apply the coordinate descent method to further enhance the solution of the TL-JBAS algorithm. In addition, we revisit mechanical movable antennas (MMAs) to establish a benchmark for evaluating the performance of REMA-enabled multiuser communications, where MMAs can continuously adjust the positions within the transmission region. We also formulate a sum-rate maximization problem for MMA-enabled multiuser communications and propose an alternating beamforming and antenna position optimization scheme to solve it. Finally, we analyze the performance gap between REMAs and MMAs. Based on Fourier analysis, we derive the maximum power loss of REMAs compared to MMAs for any given position interval. Specifically, we show that the REMA incurs a maximum power loss of only 3.25\% compared to the MMA when the position interval is set to one-tenth of the wavelength. Simulation results demonstrate the effectiveness of the proposed methods.
Abstract:Flexible-geometry arrays have garnered much attention in wireless communications, which dynamically adjust wireless channels to improve the system performance. In this paper, we propose a novel flexible-geometry array for a $360^\circ$ coverage, named flxible cylindrical array (FCLA), comprised of multiple flexible circular arrays (FCAs). The elements in each FCA can revolve around the circle track to change their horizontal positions, and the FCAs can move along the vertical axis to change the elements' heights. Considering that horizontal revolving can change the antenna orientation, we adopt both the omni-directional and the directional antenna patterns. Based on the regularized zero-forcing (RZF) precoding scheme, we formulate a particular compressive sensing (CS) problem incorporating joint precoding and antenna position optimization, and propose two effective methods, namely FCLA-J and FCLA-A, to solve it. Specifically, the first method involves jointly optimizing the element's revolving angle, height, and precoding coefficient within a single CS framework. The second method decouples the CS problem into two subproblems by utilizing an alternative sparse optimization approach for the revolving angle and height, thereby reducing time complexity. Simulation results reveal that, when utilizing directional radiation patterns, FCLA-J and FCLA-A achieve substantial performance improvements of 43.32\% and 25.42\%, respectively, compared to uniform cylindrical arrays (UCLAs) with RZF precoding.
Abstract:Typical reconfigurable intelligent surface (RIS) implementations include metasurfaces with almost passive unit elements capable of reflecting their incident waves in controllable ways, enhancing wireless communications in a cost-effective manner. In this paper, we advance the concept of intelligent metasurfaces by introducing a flexible array geometry, termed flexible intelligent metasurface (FIM), which supports both element movement (EM) and passive beamforming (PBF). In particular, based on the single-input single-output (SISO) system setup, we first compare three modes of FIM, namely, EM-only, PBF-only, and EM-PBF, in terms of received signal power under different FIM and channel setups. The PBF-only mode, which only adjusts the reflect phase, is shown to be less effective than the EM-only mode in enhancing received signal strength. In a multi-element, multi-path scenario, the EM-only mode improves the received signal power by 125% compared to the PBF-only mode. The EM-PBF mode, which optimizes both element positions and phases, further enhances performance. Additionally, we investigate the channel estimation problem for FIM systems by designing a protocol that gathers EM and PBF measurements, enabling the formulation of a compressive sensing problem for joint cascaded and direct channel estimation. We then propose a sparse recovery algorithm called clustering mean-field variational sparse Bayesian learning, which enhances estimation performance while maintaining low complexity.